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Abstract — We consider the problem of cooperation in dis- 
tributed wireless networks of selfish and free transmitters aiming 
at maximizing their energy-efficiency. The strategy of each 
transmitter consists in choosing his power control (PC) policy. 
Two scenarios are considered: the case where transmitters can 
update their power levels within time intervals less than the 
channel coherence time (fast PC) and the case where it is updated 
only once per time interval (slow PC). One of our objectives is 
to show how cooperation can be stimulated without assuming 
cooperation links between the transmitters but only by repeating 
the corresponding PC game and by signals from the receiver. In 
order to design efficient PC policies, standard and stochastic 
repeated games are respectively exploited to analyze the fast 
and slow PC problems. In the first case a cooperation plan 
between transmitters, that is both efficient and relies on mild 
information assumptions, is proposed. In the second case, the 
region of equilibrium utilities is derived from very recent and 
powerful results in game theory. 

I. Introduction 

In the wireless literature, when it is referred to cooperative 
networks, this generally means that some nodes in the network 
act as relays in order to help other nodes (the sources or trans- 
mitters) to better communicate with their respective destination 
or receiver nodes. This idea has been formalized in information 
theory in |1||2| for the relay channel, for the cooperative 
multiple access channel (MAC) (|3], and for other types of 
cooperative channels during the last decade (lUllSl etc). The 
vast majority of these papers address centralized networks 
and cooperation between nodes is based on the existence of 
physical links between some nodes. In the present paper we 
consider the case of decentralized or distributed networks with 
implicit cooperation. By decentralized/distributed, we mean 
that the nodes are assumed to be free decision makers who 
decide by themselves what is good for them and can ignore 
possible recommandations from central nodes (namely the 
power control policy in our case). By implicit, we mean that 
nodes cooperate without using dedicated cooperation channels 
between nodes. To be more concrete, we consider multiple 
access channels where no link between the transmitters is as- 
sumed and transmitters are modeled by selfish players aiming 
at maximizing the energy-efficiency of their communication. 
A very simple and pragmatic way of knowing to what extent 
a communication is energy-efficient has been proposed by 
ISHll. The authors of these articles define energy-efficiency 
as the net number of information bits that are transmitted 



without error per time unit (goodput) to the transmit power 
level. More specifically, the authors analyze the problem of 
distributed power control (PC) in flat fading multiple access 
channels. The problem is formulated as a non-cooperative 
one-shot game where the players are the transmitters, the 
strategy of a given player is his transmit power for a given 
channel realization, and his payoff/reward/utility function is 
energy-efficiency of his communication with the receiver 
Unfortunately, the Nash equilibrium (NE) resulting from this 
game is generally inefficient. 

In the papers on energy-efficient power control cited above 
and related papers (e.g., |l8l||9]), the used game-theoretic 
framework is the one of static or one-shot games for which 
transmitters are assumed to interact once per block and from 
block to block in an independent manner; the block duration 
is assumed to be less than the channel coherence time. In 
practice, there will be some scenarios where transmitters can 
update their power level several times within a block or/and are 
active over several and possibly many blocks. In game theory, 
it is well known that this feature can change the behavior 
of the players and incite them to cooperate while staying 
selfish |10|. The corresponding game-theoretic framework is 
then the one of dynamic games. In this paper we propose to 
model the distributed energy-efficient power control problem 
by exploiting two types of repeated games (RG), namely 
the standard and stochastic RG, which are special cases of 
dynamic games. In standard RG ITTI . the same game is 
repeated a certain number of times. In stochastic RG lfT2ll . 
players' utilities depend on a certain state (or parameters) 
which vary over time according to a stochastic process. We 
use standard RG to analyze scenarios where transmitters can 
update their powers several times within each block (the same 
game is therefore repeated within a block) and stochastic RG 
for scenarios where transmitters update their powers once per 
block (the game is therefore parameterized by the channel state 
and is repeated from block to block). We will respectively refer 
to these scenarios as fast and slow power control (FPC, SPC). 

The contributions of this paper are as follows: 1. The 
framework of repeated games is applied for the first time to the 
distributed energy-efficient power control problem; 2. In the 
case of FPC, we derive equilibrium PC strategies which are 
based on a cooperation plan between the transmitters, Pareto- 
efficient, and only require individual channel state information 



(CSI) at the transmitters and a public signal to be implemented; 
3. In the case of SPC, which is much more difficult to treat 
properly, only the set of possible equilibrium utilities of the 
stochastic RG (which can be seen as a counterpart of a capacity 
region of a distributed channel when Shannon transmission 
rate is considered for the utilities) is derived by exploiting a 
very recent result in game theory derived by Horner et al. lfT3l 
and Fudenberg and Yamamoto (T^\. To achieve this utility 
region, global CSI and a public signal are assumed at the 
transmitters. The determination of the equilibrium strategies 
is left as a non-trivial extension of this work. 

II. Signal model 

We consider a distributed MAC with a finite number of 
users, which is denoted by K. The network is said to be 
distributed in the sense that the receiver (e.g., a base station) 
does not dictate to the transmitters (e.g., mobile stations) 
their PC policy. Rather, all the transmitters chooses their 
policy by themselves and want to selfishly maximize their 
energy-efficiency; in particular they can ignore some specified 
centralized policies. We assume that the users transmit their 
data over block flat fading channels. The equivalent baseband 
signal received by the base station can be written as 
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with i € /C, ^ = {!,..., K}, Elxjp = z C7V(0,o-2). 
Each channel gain gi varies over time following a Markov 
chain and is assumed to be constant over each block. For 
each transmitter i, the channel gain gi is assumed to lie in 
a discrete set (e.g., because channel chains are quantized). 
The notation rji — \gif', with 77^ £ Fi, |Fi| < +cxd, will be 
used. For the transmit power levels pi they will be assumed 
to lie in a compact set Vi — [Q,P™^^] in Sec. Hill and in a 
discrete set = {P} , P^') , with P^'' = P^'''', in Sec. 
II VI The discrete set assumption is suited to exploit the results 
of lfT3llfT4l without introducing additional technicalities we 
wanted to avoid in this (relatively short) paper. At last, the 
receiver is assumed to implement single-user decoding. 

III. Fast power control and standard repeated 

GAMES 

A. Review of the one-shot power conrol game 

Here we review a few key results from Q concerning the 
static PC game. We denote by Ri the transmission information 
rate (in bps) for user i and / an efficiency function represent- 
ing the block success rate, which is assumed to be sigmoidal 
and identical for all the users. For a given block, the signal-to- 
interference plus noise ratio (SINR) at receiver i G {!,..., K} 
is denoted by SINR^ and writes as: 



SINR, 



(2) 



where Pi e [0,P™'^'']. With these notations, the static PC 
game, denoted by Q, is defined in its normal form as follows. 



Definition 1 (Static PC game): The static PC game is a 
triplet Q = {K.,{Vi}ieK,{'^i}ieK.) where K = {!,..., i^} is 
the set of players, Vi, ■■■,Vk ore the corresponding discrete 
sets of strategies, Vi = {0, . . . , P^^^}, P^^^ is the maximum 
transmit power for player i, and Ui, are the utilities of 

the different players which are defined by: 

i?J(SINR,) 



Ui{pi, ...,pk) = 



Pi 



[bit/ J]. 



(3) 



We suppose from now that the above description of the game 
is common knowledge and the players are rational (every 
player does the best for himself and knows the others do so 
and so on). An important game solution concept is the Nash 
equilibrium (i.e., a point from which no player has interest in 
unilaterally deviating). When it exists, the non-saturated Nash 
equilibrium of this game is given by 

-2 /3* 



(4) 



where [3* is the unique solution of the equation xf'{x) — 
f{x) = 0. By using the term "non-saturated NE" we mean that 
the maximum transmit power for each user, denoted by p™^^, 
is assumed to be sufficiently high for not being reached at the 
equilibrium i.e., each user maximizes his energy-efficiency for 
a value less than p™^^ (see |9| for more technical details about 
this assumption). An important property of the NE given by (|4]i 
is that transmitters only need to know their individual channel 
gain (i.e., gi) to play their equilibrium strategy. One of the 
interesting results we want to prove is that it is possible to 
obtain a more efficient equilibrium point when transmitters 
can update their powers several times per block while keeping 
this key information property of individual CSI. 

B. The discounted repeated power control game 

In this section, we assume that the transmitters can update 
their powers within time intervals less than the channel coher- 
ence time. The instants at which the transmitters update their 
powers are called game stages. Therefore, for each channel 
realization g = {gi, ...^gK) a given repeated game is played. 
As mentioned in Sec. |I1 the fact that the PC game is repeated 
induces new behaviors (namely cooperative behaviors) for 
the transmitters. Because of repetition, selfish but efficient 
agreements between transmitters are possible. In this work, 
we propose an operating point (OP) of the one-shot PC game 
which can serve as a part of a cooperation plan between the 
transmitters. Before defining the repeated power control game, 
let define the proposed OP. 

By considering all the points such that pi G 

[Q^P™^^], i G /C, one obtains the feasible utility region. We 
consider a subset of points of this region for which the power 
profiles (pi, ...,pk) verify p^lg^p = P]\g3? for all (i, e lO . 
Such a subset is made of the following system of equations: 

e 1C\ -^{p) = with p,\g,f = p,\g,f. (5) 

It turns out that, following the lines of the proof of SE 
uniqueness in 1 9 1, it is easy to show that a sufficient condition 



for ensuring both existence and uniqueness of the solution 
to this system of equations is that there exists xq e]0, -j^zrii 

such that jTj^ — i-^(ji^i)x strictly positive on ]0, a;o[ 
and strictly negative on ]xo, -/^3x[- It is satisfied for the two 
efficiency functions the authors are aware of, which are: 



Hill with c = 2'^ - 1 



fix) = 161 and /(a 

(R is the transmission rate). Under the aforementioned condi- 
tion, the unique solution of (|5]l can be checked to be: 
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Vi e /C, p. 
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(6) 



r]i 1- {K ~ 1)7 

where 7* is the unique solution of x[l — [K — 1) • x]f'{x) — 
fix) = 0. The proposed OP, given by (|6]), is thus fair in the 
sense of the SINR since V« G /C, SINR^ = 7*. We are going to 
exploit this point of the one-shot PC game to build equilibrium 
strategies of the DRG. 

Let us define a strategy for the discounted repeated game. 
The transmitters are assumed to receive a public signal s(t) 
after playing at game stage t and keep this in memory. This 
public signal is linked to the actions of the transmitters by an 
observation function <j) : Vi x ... x Vk — > <S. 

Definition 2 (Players' strategies in the RG): A pure strat- 
egy for player i G IC is a sequence of causal functions {Tit)^^^ 
with 

■Ht ^ [0,P, 



Ti,t 



Mt) 



(7) 



where t is the game stage index, = (s(l), s(t — 1)) is 
the game history vector and Ht = S*~^. 
The strategy of player i, which is a sequence of functions, will 
be denoted by t^. The vector of strategies t = {ti,...,tk) 
will be referred to a joint strategy. A joint strategy r induces 
in a natural way a unique action plan {p{t))t>i. To each 
profile of powers p{t) corresponds a certain instantaneous 
utility Ui{p{t)) for player i. In our setup, each player does 
not care about what he gets at a given stage but what he 
gets over the whole duration of the game. This is why we 
consider a utility function resulting from averaging over the 
instantaneous utility. 

Definition 3 (Players' utilities in the RG): Let 
T — {ti,...,tk) be a joint strategy. The utility for player 
i €z IC is defined by: 



vHz) 



\(\^Xf-^u,{p(t)) 



(8) 



where p(t) is the power profile of the action plan induced by 
the joint strategy t and < X < 1 is a parameter of the DRG 
called the discount factor and is known to every player (since 
the game is with complete information). 
In the current available wireless literature on the problem 
under investigation discounted repeated games (DRG) are used 
as follows: in ifTSI the discount factor is used as a way of 
accounting for the delay sensitivity of the network; in 1 17| the 
discount factor is used to let the transmitters the possibility to 
value short-term and long-term gains differently. Interestingly, 
lfmilT2 l offers another interpretation of this model. Indeed, the 



author sees the DRG as a finite RG where the number of game 
duration would be unknown to the players and considered as 
an integer-valued random variable, finite almost surely, whose 
law is known by the players. Otherwise said, A can be seen 
as the stopping probability at each game stage: the probability 
that the game stops at stage t is thus A(l — A)*^^. The function 
vl would correspond to an expected utility given the law of 
the game duration. This shows that the discount factor is also 
useful to study wireless games where a player enters/leaves 
the game. 

Theorem 4 (Equilibrium strategies in the DRG): Assume 
that the following condition is met: 

1 - (if - 1)7* /(7*) l-{K~l)p* 



A < 



(9) 



> 1, T,t 



(if -1)7* /(/?*) {K-l)p* 

Then, for all i lE JC, the following action plan is a subgame 
perfect NE of the DRG for any distribution for the channel 
gains: 

pf^ if the other players play p-i 
p* otherwise 

(10) 

The proof of this theorem is not provided here; the main 
idea of the proof is to derive a sufficient condition on the 
discount factor such that the maximum gain induced by 
a unilateral deviation is less than the loss induced by the 
punishment procedure that the other transmitters apply by 
playing at the one-shot game NE. The proposed cooperation 
plan therefore consists in playing at the operating point if no 
transmitter deviates from this point. If one transmitter deviates 
from the OP, then all the other transmitters play the action 
corresponding to one-shot game NE. At this point it is possible 
to see very clearly the information assumptions needed to 
implement the proposed distributed power control policies. 
To play at pf^ or p* only the individual CSI (rji — Ig^p) 
is needed by each transmitter To detect the deviation of a 
transmitter we propose the following mechanism: the receiver 
broadcasts the pubUc signal s{t) = ''liit)Pi{t) G S 

(note that the knowledge of the individual SINR is a sufficient 
condition to re-construct this public signal). At the OP, this 
signal equals ■ Thus, if one transmitter deviates 

all the other transmitters detect this unilateral deviation and 
can therefore stop cooperating and start playing the one-shot 
game NE. Interestingly, the proposed equilibrium strategies 
have been found to be Pareto-optimal for all simulations we 
have performed. As a result, the proposed PC policies are both 
efficient and rely on reasonable information assumptions. For 
comparison, the policies based on pricing ifTsl require global 
CSI. 

IV. Slow power control and stochastic 

DISCOUNTED REPEATED GAMES 

From now on, we consider a more general scenario in which 
channel gains ?/ can vary from game stage to game stage. 
The utility function at a given game stage therefore depends 
not only on the profile of actions p{t) played at stage t but 
also on the vector of channel gains g{t) = {gi{t), ...,gK{t)) 



and more precisely on rj{t) — {r]i{t), ...,riK{t)) G T, with 
r = Fi X ... X Tk- The corresponding game-theoretic frame- 
work is the one of stochastic repeated games. Our objective is 
to characterize the set of equilibrium utilities of the repeated 
game. This can be thought of as a counterpart of a capacity 
region in information theory. The corresponding result is called 
a Folk theorem. It turns out that no general Folk theorem 
is available for stochastic RG. It is only very recently that 
some authors 113|[14| succeeded to derive a Folk theorem for 
stochastic RG with public information. To be able to exploit 
these very interesting results we assume that every transmitter 
knows the public signal s e iS as in the previous section and 
has global CS1 1] at each game instance or stage. An important 
condition which is assumed to be satisfied by the channel gain 
process is the irreducibility property. 

Definition 5: Let rj and rj' be two channel states and 
7r(?7'|?7), the probability that the next state will be rj' knowing 
that the actual state is rj. The transition probability tt is 
irreducible if for any channel states rj and t]', we have 
n{rf\ri) > 0. 

The mobility in wireless communication impose that for a 
given channel reaUzation, there is always a positive probability 
for each channel gain to be drawn in the next stage. In order to 
characterize the set of equilibrium payoff, we assume that the 
transition probability is irreducible. As in the previous section, 
we assume that the player does not observe perfectly the 
actions played by the other player in the past stages (imperfect 
monitoring) but have only access to the public signal s{t) E S. 

A. The game course 

The game starts at stage t ~ 1 with an initial state g(l) 
which is known by the players. The transmitters simulta- 
neously choose a power level p(l) = (pi(l), . . . (1)) 
and get a public signal s(l) g S from 4>{p{l)). The stage 
utility, denoted by Uj(p(l), 77(1)) is not known by the player 
i. After the stage t — 1, the channel states are drawn from 
the probability distribution TT{-\ri{t — 1)) e A(r) and the 
realization is publicly announced : 77 (i) — . . . ,r]K{t)). 

Taking into account the past history of the game, the players 
choose simultaneously their action and get a public signal 
s{t) e S from and does not know their stage utility 

Ui{p{t),'q{t)), and so on. We define the vector of private h^{t) 
and public hit) history of player i : 

hiit) = (p^(l),s{l),r|(l),...,p,{t-l),s(t-l),2{t-l),r|{t)) 
hit) = (s(l),27(l),...,s(t-l),5(t-l),r,(t)) 

We define the public history of the game as the intersection 
of all private histories. Note that the private history con- 
tains the pubUc one and the sequence of transmission power 
{Pi{t))T-i>t>i of player i. The vector h{t) lies in the set 



Ht 



{S X F)*"^ X F 



(11) 



where the notation (.)*^^ refer to the Cartesian product of 
sets. This vector (fTTl l that is assumed to be known by each 
transmitters before playing for block t. The private and public 
histories are introduced in order to define the private and the 
public strategies. In the sequel we will restrict ourself only to 



the public strategies for which it is possible to characterize the 
set of equilibrium utilities. Note that this restriction does not 
affect the final result in terms of set of equilibrium utilities. In 
fact, we show that, in our framework, the players should not 
take into account their private history. A strategy is a sequence 
of functions from the history of the game onto a probability 
distribution over the set of power. 

Definition 6 (Players' strategies in the RG): A public 
strategy for player i E JC is a sequence of functions ('?i,t)t>i 
with 



Tit 



(12) 



Where A{Vi) denote the set of probability over Vi- 
The public strategy of player i will therefore be denoted by 
fi while the vector of public strategies f — (fi, ...,f/f) will 
be referred to a joint public strategy. A joint public strategy t 
induce in a natural way a unique probability Pf over the set 
of action plans (p{t))t>i and sequence of signals {s{t))t>i- 
The averaged utility for player i can then be defined as follows. 

Definition 7 (Players' utilities in the RG): Let 
T ~ (fi, fx) be a joint mixed strategy. The utility 
for player i E JC if the initial channel state is r]{l), is defined 
by: 

v^{L9) = - A)*-iE^,^ [uMt),vmva)] (13) 

t<l 

where (p(t))t>i is the sequence of power profile induced by 
the joint strategy f. 

We present now the proper definition of a stochastic repeated 
game. 

Definition 8 (Stochastic RG with Public Monitoring): A 
stochastic repeated game with public monitoring is defined 
as Q — (/C, (7i)i, {vi)i, (Fi)i, TT, iS, $), where K, is the set of 
players, % is the set of strategy of player i, Vi, her long-term 
utility function, tt is the transition probability over the set 
of channels gains {rji)i, $ is the public observation function 
and S is the set of public signals. 

We suppose from now that the above description of the game is 
common knowledge and the players are rational (every player 
does the best for himself and knows the others do so and so 
on). 

B. Equilibrium concept 

At this point, public Nash equilibrium strategies of the 
stochastic repeated game starting with the channel state g can 
be defined. 

Definition 9 (Public Equilibrium Strategies of the RG): 
A public mixed strategy r supports an equilibrium of the 
stochastic repeated game with initial channel state 77(1) if 

e JC,Wi ULvW) > v^{riz-^,vil)) (14) 

where —i is the standard notation to refer to the set /C\{i}; 
here T_j = (fi, fj_i, f^+i, f/f ). 



The notion of Nash equiHbrium in repeated game is refined by 
the sub-game perfection property, introduced by Sehen for ex- 
tensive games [19 |, |,20j . For a sub-game perfect equiHbrium, 
the incentives hold along the duration of the game. 

Definition 10 (Perfect Public Equilibrium Stmt, of the RG): 
A public strategy profile r is a perfect public equilibrium if 
for every /i* €E Ht, the continuation profile r|^t is a Nash 
equilibrium of the restricted stochastic repeated game starting 
with the channel state g{t). We denote E\{'>](1)) the set 
of Perfect public equilibrium of the game with initial state 
7?(1) G r and discount factor X. 

An important issue is precisely to characterize the set of possi- 
ble equilibrium payoff or public perfect equilibrium payoff in 
the repeated game. This kind of result often appears as "Folk 
Theorem" (see e.g.,| 10|| 11 1). A huge part of the literature 
is dedicated to find the set of equilibria under different 
assumptions, but a general characterization is still unavailable. 
Our model is included in the framework of stochastic repeated 
game with imperfect public monitoring. 

C. Independence of the initial State 

In classical models of stochastic repeated game, the initial 
state 77(1) could be determinant for characterizing the solutions 
of our problem. However, it is natural to think that the initial 
state of channel gain will not influence the future sequence 
of channel realization. We present some results of Dutta 
(1995) II2TII that formahze the above statement. Because of 
the irreducibility property of the channel stochastic process, 
the limit set of feasible utilities, the set of perfect public 
equilibrium utilities and the minmax utilities are independent 
of the initial state. 

Theorem 11 (Independence of the Initial State): Suppose 
that the stochastic repeated game is irreducible (|5]), it implies 
fliat : 

• The limit of the minmax is independent of the initial state 

i.e. YiTcix ^ominr_imaxTiVi{Ti,T_^,ri{l)) — Vi for all 

ry(l) and all i d K. 

• The limit set of feasible utilities is independent of the 
initial state i.e. Unix >o F\{vi^)) = F for all ri{l). 

• The limit set of public perfect equilibrium utilities is in- 
dependent of the initial state i.e. lim^ >.o E\{'q{\)) — E 

for all 77(1). 

The following definition is fundamental for characterize the 
set of public perfect equilibrium payoff of our repeated game. 

Definition 12: We define the set of asymptotic feasible and 
individually rational payoff by: 

F* ^ {x e F\xi > i„ yi e K} (15) 

The set F* is defined as the set of energy-efficiency utilities 
the players can get such that each of them has more than his 
minmax utility. 

D. Main Result : Folk Theorem 

The following theorem state that only a condition over the 
discount factor A is sufficient to have a sub-game perfect 
equilibrium property for a utility vector u in F*. 



Theorem 13: For each utility vector u £ F*, there exists a 
Ao such that for all A < Aq, there exists is a perfect public 
equilibrium strategy of our stochastic repeated power control 
game, such that the long-term utility equals u^F*. 
The proof is based on Horner, Sugaya, Takahashi and Vieille 
(2009), 113] ; Kandori and Matsushima (1998) [22|. 

V. Numerical Illustration of Optimal 
Equilibrium Utilities 

The above result implies that each Pareto-optimal utility 
vector that is individually rational can be sustained by a public 
perfect equilibrium strategy for a discount factor sufficiently 
small. In practice, we have to focus on a particular Pareto- 
optimal point which is individually rational. For example, 
denote by p the solution of the maximization problem : 
maxpgp s.t. ui>viyieK Y^i^K '^i'^iip) and u it's corresponding 
utility vector. The above theorem states that u is a public 
perfect equilibrium utility of the A— discounted repeated game 
for a sufficiently small discount factor if u Pareto-dominates 
the Minmax utilities. In the whole section we consider the 
same type of scenarios as |8||9| namely random code division 
multiple access systems with a spreading factor equal to N 
and the efficiency function f{x) = (1 — e^^)^^, M being the 
block length. 

□ We consider a simple stochastic process with two channel 
states: (771,772) G {(7, 1), (1, 7)}. The transition probability 
is constant over the channel states: n{-) = (i, i) and its 
invariant measure is /i = (5,5). Consider the scenario 
(K, M, N) = (2, 2, 2). Fig. [T]represents the achievable utihty 
region for the two different channel state and the long-term 
expected utility region. The positive orthan denotes the set 
of expected individually rational utilities. Its intersection 
with the expected achievable utility region describes the 
set of public perfect equilibrium utility. Three important 
points are highlighted in the different scenario: the expected 
Nash equilibrium of the one-shot game studied in jTl, the 
expected operating/cooperation point studied by Le Treust 
and Lasaulce 2010 L23|, and the point where the expected 
social welfare (sum of utilities) is maximized (star). From 
this figure it can be seen that: a significant gain can be 
obtained by using a model of repeated games instead of the 
one-shot model. Moreover, significant improvement in term 
of expected utilities is a direct consequence of the full CSI 
instead of individual CSI. 

□ As a second type of numerical results, the performance 
gain brought by the stochastic discounted repeated game 
(SDRG) formulation of the distributed PC problem is 
assessed. Considering a simple stochastic process where 
77; = 2 and r/j = 1 for all j G and the i's player 
is drawn with uniform distribution over the K players. We 
compute the expected social utility the players get at the 
social optimum wsdrg- Denote hy wne (resp. wdrg and 
wsdrg) the efficiency of the NE (resp. DRG and SDRG 
equilibrium) in terms of social welfare i.e. the sum of utilities 
of the players. Fig. |2] represents the quantity '^'^-"^'^^"'"-^ 
and "'"-"^^^"^ in percentage as a function of the spectral 



efficiency a = f with iV 128 and 2 < X < |^ + 1. 
The asymptotes a^ax ~ + indicated by dotted 

lines for different values M g {10, 100}. The improvement 
becomes very significant when the system load is close to 
jf + jT, this is because the power at the one-shot game NE 
becomes large when the system becomes more and more 
loaded. As explained in ||9l for the Stackelberg approach 
these gains are in fact limited by the maximum transmit power. 

VI. Conclusion 

Repeating a power control game is a way of introducing 
cooperation between selfish transmitters. In this paper, we 
have shown that the corresponding cooperative power control 
policies can be implemented without using explicit cooper- 
ation channels between the transmitters. In the case of fast 
PC, only individual CSI and a realistic public signal are 
required to implement the proposed schemes. In the case of 
slow power control, only the feasible utility region has been 
derived, the equilibrium power control strategies to achieve the 
corresponding points still need to be found. Both in the cases 
of fast and slow power control, the cooperation gain induced 
by the underlying cooperation plans is shown to be signifi- 
cant. The repeated game formulation of the distributed power 
control problem therefore shows a way of reaching interesting 
trade-offs in terms of global network energy-efficiency and 
signalling. 
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Fig. 2. The gain brouglit by the repeated-game based cooperation in terms 
of the sum of utilities (standard and stochastic repeated games) w.rt. to the 
purely non-cooperative scenario (Nash Equilibrium). 



